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ABSTRACT 



One of the most crucial problems in theoretical and applied statistics is to 
determine the precision of the estimates produced by different statistical estimators. 
This problem is greatly increased when the population parametric characteristics are 
not known. Parallel to this problem is that of deciding how large (or small) the sample 
population must be in order to obtain a desired precision within certain range. 

There are several non-parametric methods to approach the first problem. The - 
BOOTSTRAP Method (Efron, 1979) is one of these approaches and the one of interest 
in this thesis. With this method, one could improve the precision of the estimates and 
gain information about the distributional characteristics of statistical estimators. The 
bootstrap method has been amply compared with other methods; the results show that 
the bootstrap method often produces more precise estimates (i.e. with smaller mean 
squared error) than competitors such as the JACKNIFE, SECTIONING and 
CROSS-VALIDATION. However, the results that have been obtained are based on 
large sample sizes and large numbers of "bootstrap" replications. 

This thesis analyzes the behavior of the BOOTSTRAP method when the number 
of bootstrap replications is small. It tries to identify any tradeoffs between sample size 
and the number of bootstrap replications required to attain a desired precision in the 
estimates produced in several particular situations. One of the goals is to produce 
graphical displays that will indicate to the experimental statistician the price that must 
be paid in the precision of the estimates, obtained with the bootstrap method, when 
sample size is small, and the number of bootstrap replications to use in this situation. 
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I. INTRODUCTION 



A. BACKGROUND 

One of the most common problem in applied statistics is the estimation of an 
unknown parameter 0. Once the statistician has decided on the model having one or 
more parameters to be estimated and has selected the estimator (i.e., m.l.e., least-square 
estimator, etc.) that will be used to obtain the estimates, the second problem that he or 
she faces is how to estimate the accuracy of these estimates. There are several ways of 
measuring the accuracy or the error of statistical estimators. In this thesis, the measure 
of statistical error will be defined to be the mean squared error (MSE) of the 
estimators; i.e. the variance plus the bias-squared of 0 h (where 0 h represents the 
estimator of the parameter 0. In Appendix A the reader will find a list of special 
notations used in this thesis) : 

MSE(0 h ) = E[(0 h - 0) 2 ] = Var(0 h ) + [BlAS(0 h )] 2 (1.1) 

When the practitioner is dealing with samples obtained from populations for 
which the distributional characteristics are known, classical statistical theory provides 
an answer to the second problem that the statistician faces. This is true since, at least 
in theory, the variance and the bias of most statistical estimators can be calculated 
analytically. However, the difficulty of analytically deriving the MSE of some statistical 
estimator increases as the mathematical definition of the estimator becomes more 
complicated. When this is the case or when the practitioner does not actually know the 
probability distribution, say F, from which the sample was obtained, then the MSE of 
the estimators must be estimated. 

There are several non-parametric methods for estimating the bias and the 
variance of an estimator of interest. The most common ones are the Quenoille-Tukey 
JACKNIFE method, CROSS-VALIDATION, and SECTIONING; the Jacknife being 
the most commonly used of the three approaches. Efron and Gong [Ref. 1] and Miller 
[Ref. 2] provide an excellent exposition of the first two methods and Lewis gives a good 
introduction and analysis of the later (See [Ref. 3] ). 
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